Large-Scale Discrete Fourier Transform on TPUs

نویسندگان

چکیده

In this work, we present two parallel algorithms for the large-scale discrete Fourier transform (DFT) on Tensor Processing Unit (TPU) clusters. The are associated with DFT formulations: one formulation, denoted as KDFT, is based Kronecker product; other famous Cooley-Tukey algorithm and phase adjustment, FFT. Both KDFT FFT formulations take full advantage of TPU's strength in matrix multiplications. formulation allows direct use nonuniform inputs without additional step. algorithms, same strategy data decomposition applied to input data. Through decomposition, dense multiplications kept local within TPU cores, which can be performed completely parallel. communication among cores achieved through one-shuffle scheme both sending receiving takes place simultaneously between neighboring along direction interconnect network. designed topology clusters, minimizing time required by cores. implemented TensorFlow. three-dimensional complex an example dimension 8192 ×8192 a Pod: run 12.66 seconds that 8.3 seconds. Scaling analysis provided demonstrate high efficiency implementations TPUs.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discrete Fourier Transform on Multicores

This paper gives an overview on the techniques needed to implement the discrete Fourier transform (DFT) efficiently on current multicore systems. The focus is on Intel compatible multicores but we also discuss the IBM Cell, and briefly, graphics processing units (GPUs). The performance optimization is broken down into three key challenges: parallelization, vectorization, and memory hierarchy op...

متن کامل

Dft : Discrete Fourier Transform

A. Table of contents by sections: 1. Abstract (you’re reading this now) 2. Summary of the DFT (How do I do the homework?) 3. Review of continuous-time Fourier series 4. Bandlimited signals and finite Fourier series 5. Sampling theorem for periodic signals 6. Review of quirks of discrete-time frequency 7. Orthogonality and its significance 8. Discrete Fourier Transform (DFT) 9. Use of DFT to com...

متن کامل

The Discrete Fourier Transform∗

1 Motivation We want to numerically approximate coefficients in a Fourier series. The first step is to see how the trapezoidal rule applies when numerically computing the integral (2π) −1 2π 0 F (t)dt, where F (t) is a continuous, 2π-periodic function. Applying the trapezoidal rule with the stepsize taken to be h = 2π/n for some integer n ≥ 1 results in (2π) −1 2π 0 F (t)dt ≈ 1 n n−1 j=0 Y j , ...

متن کامل

The Discrete Fourier Transform

Disclaimer: These notes are intended to be an accessible introduction to the subject, with no pretense at completeness. In general, you can find more thorough discussions in Oppenheim's book. Please let me know if you find any typos. In this lecture, we discuss the Discrete Fourier Transform (DFT), which is a fourier representation for finite length signals. The main practical importance of thi...

متن کامل

Discrete Fourier Transform

This note provides a brief review of the Fourier transform for the analysis of discretetime signals and systems and a description of practical assignments, which will be performed on a Texas Instrument DSP board. Some tasks are to be performed with the help of MATLAB. It is assumed that the student has enough theoretical background to perform the practical part of the work. The review of some a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2021

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2021.3092312